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Abstract 

The problem of optimum watermark embedding and detection was addressed in a recent paper by Merhav and 
Sabbag, where the optimality criterion was the maximum false-negative error exponent subject to a guaranteed false- 
positive error exponent. In particular, Merhav and Sabbag derived universal asymptotically optimum embedding and 
detection rules under the assumption that the detector relies solely on second order joint empirical statistics of the 
received signal and the watermark. In the case of a Gaussian host signal and a Gaussian attack, however, closed-form 
expressions for the optimum embedding strategy and the false-negative error exponent were not obtained in that work. 
In this paper, we derive such expressions, again, under the universality assumption that neither the host variance nor 
the attack power are known to either the embedder or the detector. The optimum embedding rule turns out to be 
very simple and with an intuitively-appealing geometrical interpretation. The improvement with respect to existing 
sub-optimum schemes is demonstrated by displaying the optimum false-negative error exponent as a function of the 
guaranteed false-positive error exponent. 

Index Terms 

Watermarking, watermark embedding, watermark detection, hypothesis testing, Neyman-Pearson. 



P. Comesana is with the Signal Theory and Communications Department, University of Vigo, Campus Lagoas-Marcosende, Vigo 36310, 
Spain, (phone: +34 986 812683, fax: +34 986 812116, e-mail: pcomesan@gts.tsc.uvigo.es), N. Merhav is with the Department of Electrical 
Engineering, Technion - I.I.T., Haifa 32000, Israel, (phone/fax: +972-4-8294737, e-mail: merhav@ee.technion.ac.il), M. Barni is with the 
Department of Information Engineering, University of Siena, Via Roma 56, Siena 53100, Italy, (phone: +39 0577 234624 / +39 0577 234621, 
fax: +39 0577 233630, e-mail: barni@dii.unisi.it). 

This work was partially supported by the Italian Ministry of Research and Education under FIRB project no. RBIN04AC9W. 



March 27, 2008 



DRAFT 



2 



I. Introduction 

About a decade ago, the community of researchers in the field of watermarking and data hiding has learned about 
the importance and relevance of the problem of channel coding with non-causal side information at the transmitter 
Q], and in particular, its Gaussian version - writing on dirty paper, due to Costa [2], along with its direct applicability 
to watermarking, cf. J5), (4). Costa's main result is that the capacity of the additive white Gaussian noise (AWGN) 
channel with an additional independent interfering signal, known non-causally to the transmitter only, is the same 
as if this interference was available at the decoder as well (or altogether non-existent). When applied in the realm 
of watermarking and data hiding, this means that the host signal (playing the role of the interfering signal), should 
not be actually considered as additional noise, since the embedder (the transmitter) can incorporate its knowledge 
upon generating the watermarked signal (the codeword). The methods based on this paradigm, usually known as 
side-informed methods, can even asymptotically eliminate (under some particular conditions) the interference of the 
host signal, that was previously believed to be inherent to any watermarking system. 

Ever since the relevance of Costa's result to watermarking has been observed, numerous works have been 
published about the practical implementation of the side-informed paradigm for the so-called multi-bit watermarking 
[4], [5 1, [6|, [7| case, where the decoder estimates the transmitted message among many possible messages. Far less 
attention has been devoted, however, to the problem of deciding on the presence or absence of a given watermark 
in the observed signal. In fact, in most of the works that deal with this binary hypothesis testing problem, usually 
known as one-bit (a.k.a. zero-bit) watermarking, the watermarking displacement signal does not depend on the 
the host JU, JU, iflOl . ifTTl . lfl2l that then interferes with the watermark, thus contributing to augment the error 
probability. To the best of our knowledge, exceptions to this statement are the works by Cox et al. Q, fl3l . Liu 
and Moulin fl4l . Merhav and Sabbag 1151 and Furon 1161 . In the next few paragraphs, we briefly describe the main 
results contained in these works. 

Cox et. al. i l7?l/ : In (3), Cox et. al. introduce the paradigm of watermarking as a coded communication system 
with side information at the embedder. Based on this paradigm, and by considering a statistical model for attacks, 
the authors propose a detection rule based on the Neyman-Pearson criterion. The resulting detection region is 

igt mi 

replaced by the union of two hypercones; mathematically, this detection rule is given by pirpiT] ^ 7 ~( Q! )> where 
s is the received signal, u is the watermark, s* is the transpose of s, s* • u is the inner product of s and u, a is 
the maximum allowed false-positive probability, and r(a) is the decision threshold, which is a function of a. In a 
successive paper [13], Miller et al. also compare the performance of the strategy of O to other typical embedding 
strategies. No attempt is made to jointly design the optimum embedding and detection rules. 

Liu and Moulin ^7?]/ : In fl4l . both false-positive and false-negative error exponents are studied for the one-bit 
watermarking problem, both for additive spread spectrum (Add-SS) and a quantization index modulation (QIM) 
technique [4]. The constraint on the embedding distortion is expressed in terms of the mean Euclidean norm of the 
watermarking displacement signal, and the non-watermarked signal is also assumed to be attacked (with attacks 
that impact the false-positive error probability). For Add-SS, exact expressions of the error exponents of both 
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false-positive and false-negative probabilities are derived. For QIM, the authors provide bounds only. These results 
show that although the error exponents of QIM are indeed larger than those obtained by public Add-SS (where the 
host signal is not available at the detector), they are still smaller than those computed for private Add-SS (where 
the host signal is also available at the detector). This seems to indicate that the interference due to the host is not 
completely removed. 

Merhav and Sabbag H15V : In lfT31 . the problem of one-bit watermarking is approached from an information- 
theoretic point of view. Optimum embedders and detectors are sought, in the sense of minimum false-negative 
probability subject to the constraint that the false-positive exponent is guaranteed to be at least as large as a 
given prescribed constant A > 0, under a certain limitation on the kind of empirical statistics gathered by the 
detector. Another feature of the analysis in fT31l is that the statistics of the host signal are assumed unknown. The 
proposed asymptotically optimum detection rule compares the empirical mutual information between the watermark 
u and the received signal y to a threshold depending on A. In the Gaussian case, this boils down to thresholding the 
absolute value of the empirical correlation coefficient between these two signals. Merhav and Sabbag also derive the 
optimal embedding strategy for the attack-free case and derive a lower bound on the false-negative error exponent. 
Furthermore, the optimization problem associated with uptimum embedding is reduced to an easily implementable 
2D problem yielding a very simple embedding rule. In that paper, Merhav and Sabbag study also the scenario 
where the watermarked signal is attacked. In this case, however, closed-form expressions for the error exponents 
and the optimum embedding rule are not available due to the complexity of the involved optimizations. 

Furon j [/6l/ : In 11161 . Furon uses the Pitman-Noether theorem ifTTl to derive the form of the best detector for a given 
embedding function, and the best embedding function for a given detection function. By combining these results, 
a differential equation is obtained, that the author refers to as the fundamental equation of zero-bit watermarking. 
Furon shows that many of the most popular watermarking methods in the literature can be seen as special cases 
of the fundamental equation, ranging from Add-SS, multiplicative spread spectrum, or JAMS [18|, to a two-sheet 
hyperboloid, or even combinations of the previous techniques with watermarking on a projected domain lfl9l . 
or watermarking based on lattice quantization. Compared with the framework introduced in fl5l . two important 
differences must be highlighted: 

• In 1161 . the watermarking displacement signal is constrained to be a function of the host signal which is 
scaled to yield a given embedding distortion. This means that in this set-up the direction of the watermarking 
displacement signal can not be changed as a function of the allowed embedding distortion. 

• One of the conditions that must be verified in order to apply the Pitman-Noether theorem is that the power of 
the watermarking displacement signal goes to zero when the dimensionality increases without bound. In fact, 
Furon hypothesizes that this is the reason why neither the absolute normalized correlation nor the normalized 
correlation are solutions of the fundamental equation. 

In this paper, we extend the results of 1T31 and derive a closed-form expression for the optimum embedding 
and detection strategies in the Gaussian set-up, that is, for a Gaussian host signal and a Gaussian attack channel. 
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As in 1 15 1, we assume that the embedder and the detector do not know the variance of the host signal and that of 
the noise added by the attacker. We also share with |[T5l the assumption that the detector is of limited resources, 
specifically, that it relies only on the Euclidean norm of the received signal and the empirical correlation between the 
received signal and the watermark. We derive explicit embedding and detection rules and establish their asymptotic 
optimality in the Neyman-Pearson sense of maximizing the false-negative error exponent for a given guaranteed 
false-positive error exponent. We also derive a closed-form expression for the false-negative error exponent. The 
optimum embedding strategy turns out to be very simple, and this opens the door to the development of new practical 
watermarking schemes for real-life signals like images, video or audio signals. The improved performance of the 
new scheme is demonstrated both theoretically, by comparing the achieved error exponents and those achieved by 
previous methods, and numerically, by displaying graphs of the error exponent functions. 

The remaining part of the paper is organized as follows: In Section HU we introduce notation conventions and 
formalize the problem. In Section [HI] an asymptotically optimum detection region is derived. In Section [IV] we 
use it to compute the false-negative error exponent, whose optimization is considered in Section [V] to derive 
a corresponding optimum embedder. In Section |VI| the optimum embedder and the exact false-negative error 
exponent for the noiseless case are introduced as a by-product of this analysis and compared to previous results 
in the literature. Finally, the main results of this work are summarized in Section IVTA where some suggestions for 
future research are also outlined. 

II. Notation and Problem Formulation 

Throughout the sequel, we denote scalar random variables by capital letters (e.g., V), their realizations with 
corresponding lower case letters (e.g., v), and their alphabets, with the respective script font (e.g., V). The same 
convention applies to n-dimensional random vectors and their realizations, using bold face fonts (e.g., V, v). The 
alphabet of each corresponding n-vector will be taken to be the n-th Cartesian power of the alphabet of a single 
component, which will be denoted by the alphabet of a single component with a superscript n (e.g.,V"). The z-th 
component of a vector V is denoted Vi. The probability law of a random vector V is described by its probability 
density function (pdf) /y(v), or its probability mass function (pmf) Py(V = v), depending on whether it is 
continuous or discrete, respectively. 

Let u and x, both n— dimensional vectors, be the watermark sequence and the host sequence, respectively. While 
Ui, i = 1, . . . , n, the components of u, take on binary values in « = {-l,+l}fl the components of x, namely, Xi, 
i = 1, . . . , n, take values in X = ]R. The embedder receives x and u, and produces the watermarked sequence y, 
yet another n-dimensional vector with components in y — H. We refer to the difference signal w = y — x as the 
watermarking displacement signal. The embedder must keep the embedding distortion e?(x, y) = ||y — x|| 2 within 
a prescribed limit, i.e., d(x, y) < nD, where D > is the maximum allowed distortion per dimension, uniformly 
for every x and u. 

'The basic derivations of this work will remain valid for different choices of U. 
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The output signal of the transmitter may either be the unaltered original host x, in the non-watermarked case, 
or the vector y, in the watermarked case. In both cases, this output signal is subjected to an attack, which yields 
a forgery signal, denoted by s. The action of the attacker is modeled by a channel, which is given in terms of a 
conditional probability density of the forgery given the input it receives, W(s|x) - in the non-watermarked case, 
or W(s|y) - in the watermarked case. For the sake of convenience, we define z as the noise vector added by 
the attacker, i.e., the difference between the forgery signal s and the channel input signal, which is the transmitter 
output (x or y, depending on whether the signal is watermarked or not). We assume that z is a Gaussian vector 
with zero-mean, i.i.d. components, all having variance a 2 z . 

The detector partitions H™ into two complementary regions, A (a.k.a. the detection region) and A c . If s E A, 
the detector decides that the watermark is present, otherwise it decides that the watermark is absent. We assume 
that the detector knows the watermark u, but does not know the host signal x (blind or public watermarking). The 
design of the optimum detection region for the attack-free case was studied in 115L and it is generalized to the 
case of Gaussian attacks in Section Hill 

The performance of a one-bit watermarking system is usually measured in terms of the tradeoff between the 
false positive probability of deciding that the watermark is present when it is actually absent, i.e., 

Pf, = /ds • [2.(4 + 4)]- /2 • exp {-Jf^) } (1) 

A 

and the false negative probability, of deciding that the watermark is absent when it is actually present, i.e., 

P fn = |ds |dx. (2^)-«/2 • exp {-^P} • (2 7 r4)-«/ 2 . exp {- l|S ~^' U) " 2 } , (2) 

A" JR n 

where / is the embedding function, that is, y = /(x, u). As n grows without bound, these probabilities normally 
decay exponentially. The corresponding exponential decay rates, i.e., the error exponents, are defined as 

Ef p = lim --In Pfp, (3) 
n-+oo n 

E f n 4 lim -~ In P fn . (4) 

7WOO n 

The aim of this paper is to devise a detector as well as an embedding rule for a zero-mean, i.i.d. Gaussian host 
with variance o\ and a zero-mean memoryless Gaussian attack channel with noise power a\, where the detector is 
limited to base its decision on the empirical energy of the received signal and its empirical correlation with u. Both 
o\ and a% are assumed unknown to the embedder and the detector. We seek optimum embedding and detection 
rules in the sense of uniformly maximizing the false-negative error exponent, Ef n , (across all possible values of 
o\ and a 2 z ) subject to the constraint that Ef p > A, where A is a prescribed positive real. 
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III. Optimum Detection and Embedding 

In A3! , an asymptotically optimum detector is derived for the discrete case and for the continuous Gaussian 
case. In the latter case, it is shown that if the detector is limited to base its decision on the empirical energy of the 
received signal, — X)™=i s f > an( ^ ^ ts empirical correlation with the watermark, — Y^i=i u i s i> tnen an asymptotically 
optimum decision strategy, in the above defined sense, is to compare the (Gaussian) empirical mutual information, 
given by: 



I US (U;S) = ~ In 



\n J2i=l u i s i) 



to A, or equivalently, to compare the absolute normalized correlation 

I i ^ n I 
I n Z^i=l U i S i | 



1 y-« 2 



| pus | 



n 2-ai=\ °i 



(5) 



(6) 



to a/1 - < 



-2A 



i.e., the detection region is the union of two hypercones, around the vectors u and u, with a spread 



depending on A. This decision rule of thresholding the empirical mutual information, or empirical correlation, is 
intuitively appealing since the empirical mutual information is an estimate of the degree of statistical dependence 
between two data vectors 

For the present setting, we have to extend the analysis to incorporate the Gaussian attack channel. But this turns 
out to be staightforward, as in the non-watermarked case (pertaining to the false-positive constraint), s continues 
to be Gaussian - the only effect of the channel is in changing its variance, which is assumed unknown anyhow. 
Thus, the same detection rule as above continues to be asymptotically optimum in our setting as well. 

Before we proceed to the derivation of the optimum embedder, it is instructive to look more closely at the 
dependence of the detection region on the false-positive exponent A. As mentioned earlier, the choice of A imposes 
a threshold that must be compared with © in order to provide the detector output. This is equivalent to establishing 
the limit angle of the detection region, that we will denote by (3 — arccos(Vl — e~ 2X ) = arcsin(e _A ) € [0, tt/2]. 
Letting 9 — arccos(pus), we then have: 



7p 



= Pi{O<0 < /3\H }+Pt{tv- p <6 <tt\H } 



2Pr{0 <0< p\H } 



2A n {P) 



a n ln(sin (3) 



(7) 



A n {n) 

where the notation = stands for equality in the exponential scale as a function of n[] and where A n (6) is the 
surface area of the 71-dimensional spherical cap cut from a unit sphere centered in the origin, by a right circular 
cone of half angle 9. In (|7), we used the fact that in the non-watermarked case, where s is a zero-mean Gaussian 



2 lt is also encountered in the literature of universal decoding the maximum mutual information (MMI) decoder for unknown memoryless 
channels. 

~ More precisely, if {ctn} and {fon} are two positive sequences, a n = b n means that lim„^oo ^ log = 0. 
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vector with i.i.d. components, independent of u, the normalized vector s/||s|| is uniformly distributed across the 
surface of the n-dimensional unit sphere, as there are no preferred directions. The exact expression of A n (8) is 
given by: 



A n (0) = 



(n-l)^- 1 )/ 2 

r(f) 



Sill 



(«-2) 



(<p)d(p. 



IV. The False-Negative Exponent 

In this section, we make the first step towards the derivation of the optimum embedding strategy. In particular, 
we compute the false-negative error exponent as a function of the watermarking displacement signal w, which is 
represented by a three-dimensional vector v = (ui,,i>2, U3). The vector v is the vector w, normalized by y/n, and 
transformed to the coordinate system pertaining to the linear subspace spanned by u, x and w. This result will be 
used later to derive the optimal embedding function subject to the distortion constraint, that limits the norm of w 



v 2 < D. To this end, we establish the following 



not to exceed nD, which corresponds to the constraint v\ + 1 
theorem. 

Theorem 1: Let Pf p , Pf n and their corresponding error exponents Ef p and Ef n , be defined as in eqs. {T),©,® 



and (0|, respectively. Let v = (vi,V2,va) € H 3 be given, and let A = {s : > 1 



-2A 



}. Then, 



Ef n 



mm mm mm 

qe[max(0,Ti(r,a,V)),oo) rG[0,oo) ae[-7r/2,7r/2] 



hi 



r 



In 



where 



Ti(r, a, v) = (\/rsma- 



•x 



ln(cos a) 



(8) 



1 



(v^ 



cos a 



V2) 



cos 2 P 

Proof. For convenience, let us apply the Gram-Schmidt orthogonalization procedure to the vectors u, x and w, 
and then select the remaining n — 3 orthonormal basis functions for M" in an arbitrary manner. After transforming 
to the resulting coordinate system, the above vectors have the forms u = (y/n, 0, 0, . . . , 0), x = (xi,X2,Q, . . . , 0), 
w = (tui, W2, W3, 0, . . . , 0) and y = (xi+wi, X2+W2, W3, 0, . . . , 0), while all the components of the noise sequence 
z will remain, in general, non-null. From ©, the false-negative event occurs whenever 

(xi + wi + zi) 2 



(Xl + Wl + Zi) 2 + (X2 +W 2 + Z2) 2 + (1V3 + Z 3 ) 2 + YT 3 =i z 



< COS /?, 
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where w\ + w\ + w 3 < nD, x\ = nr sin 2 a and x\ = nr cos 2 a, with r being given by r = ^ , and 
a = arcsin ^ | | x*' | ju | | ) • Equivalently, the false negative event can be rewritten as: 

(xi + y/nvi + zx f ( \— - 1 ] - (x 2 + \/nv 2 + z 2 f - (Vnv 3 + z 3 f 

= (Vnrsin(a) + y/nvi + zxf ( 57^7-! 

\ cos (P) / 



- [Vnrcos(a) + y/nv 2 + z 2 ] 2 - (y/nv 3 + z 3 ) 2 < ^ z 2 = (n - 3)<?, 

j =4 

where = ^ E"=4 • B Y defining 

T x = (V^sina + v x ) 2 ( — \— - 1 ] - (Vrcosa + w 2 ) 2 - w 2 , 
\ cos z p J 

and 

T 2 = -[z 2 + 2zi(Vnrsina + V^wi)] f - 1 ) +z| 

\ cos z p / 

+ 2z 2 [y/nr cos a + V™^] + £3 + 2Vnv 3 z 3 , 
the presentation of the false negative event can be further modified to 

nT Y < (n-3)q + T 2 , 

or equivalently 

nTi T 2 

q > 



(9) 



n — 3 n — 3 

Next, observe that ^^3. ls a y2 ran( j om variable with n — 3 degrees of freedom, i.e., 

/«(«)= Wll^i 6 ' lfq ~° . (10) 

I 0, elsewhere 

By the same token, R = , is also a \ 2 distribution, this time, with n degrees of freedom, and so its density 
is given by 



f R {r) = { 



" . (11) 
0, elsewhere 



Defining ^ = arcsin(< X, u > /||X||), we have (in the absence of a watemark): 

Pf* < a) = 1 - ^ (7r/2 ' a) 
1 " J 2A„(V2) ' 
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from which it follows that the pdf of W is 



and so 



P 



. , , dP(V<a) 2r(f) n _ 2 

Ma) = ^— = v^W) C0S a - 



k/2 4-00 +00 +00 +00 +00 (n—3)/2 

11 •'! /' 1 N 



/ 



a\ \2 



a=-Tv/2r=0 z 3 =-oo z 2 =-oo * 1= -oo g=max ( , -J^) 

z 2 z 2 z 2 

1 /(n- 3)<A -*=p§^ e _i "¥ 



e z 



r (^) V 0% J V^l V^l v 7 ^ 

n / 1 \ 1 / nr 



(f-i) 2 r(^ 

„ 2ct 2 V 2 



e — ^= — T^rrr cos" 2 a ■ dqdz\dz2dz2,drda 



Using the facts that lim n ^ 00 -3^ = Ti and that T 2 grows sublinearly with n, we get 



n — 3 n— 3 

tt/2 +00 +00 +00 +00 +00 

in „ - |T1 

n^oo 7i z z n^oo J1 

a=-7r/2 r=0 z 3 = -oo z 2 = -°o Zi=— oo <j=max(0,Ti) 



lint In P^ n = —— — — — lim — In 



where we used the fact that 



e z e z e z 

v 

V /27r(J I V^TTfli V /2?rCr l 

e "z e z e ' "x e 2 "x x 

e ( n -2)ln(cosa) d(7dzidz2(iz3drda 

(l/2)t n 



lim — In 

n — >oo 



r(n/2) 



(12) 



Finally, by using the saddle-point method B20I . the exponential rate of this multi-dimensional integral is dominated 
by the point at which the integrand is maximum, and we obtain the result asserted in the theorem. This completes 
the proof of Theorem 1 . 

V. The Optimum Watermarking Displacement Signal 

Having derived Ef n as a function of v, we are now ready to derive the main result of this paper, which is the 
optimum embedding function, i.e., the one that maximizes Ef n . 

Theorem 2: The maximum false-negative exponent, Et n , subject to the constraint v\ + v 2 +w| < D, is achieved 

by v* — (vl,V2,v^) where: 



u* = ±y/D -rcos 4 /3, 
v* 2 = — y/rcos 2 (3, 



v* 3 = 0. 
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Proof. Consider first the dependence of Ef n on a. On the one hand, — ln(cosa) is minimized when a = 0. On the 
other hand, T\ also depends on a. Since Ef n is monotonically non-decreasing in T\ and the distortion is insensitive 
to the sign of any component of the watermark, it is seen from eq. (O that the signs v\ and V2 should be such that 
vi sin a > 0, and vi cos a < 0. Therefore Ti(r, a) is even in a, and its minimum is reached at a = 0. This means 
that the minimum of dS) is obtained for a — 0, and then © can be rewritten as 



lim InP 

n— 'CO fi 



In 



1 

mm — 

((j,r)G[max(0,Ti(r-)),oc)x[0,oo) 2 



4- -In 4l-l 
CT z W 



-In 



- 1 



(13) 



" i u x \ u x. 

As the objective function is convex in (r, q), and the global minimum is at (a^, a z ), the minimum in ([T3l would 
vanish if {a 2 z ,a\) € [max(0, Ti(r)), oo) x [0, oo). Otherwise, the minimum lies on the boundary, i.e., it is a point 
of the form (?i(r), r), with r > 0. 

Consider next the optimization of (vi,V2,v 3 ). Observe that the only influence of v on Ef n is via T\. Thus, v 
should be chosen so as to maximize T\, Given that a = 0, Ti can be written as 



\ cos z p J 
which should be maximized over v subject to 



4 < D. 



Obviously any non-zero value of V3, both decreases T\ and reduces the distortion budget remaining for V\ and v%. 
Thus, U3 — 0. Now, T\ is monotonically increasing in v\ , so the maximum must be achieved for v\ + v\ = D, 
which enables to express T\ aj^l 

-1 2 



1 



cos 2 (3 



1 



Equating dT\/dvx to zero and solving for V\, we obtain three solutions: 



vt = 



Uj = —y/D — r cos 4 /? 
«i = \JD — r cos 4 /3 



Considering the second derivative, it is easy to see that for v\ — ±y/ D — r cos 4 [3 one obtains maxima of T\, 
yielding u| = — y^cos 2 /3, and a corresponding value of T\ = Dtan 2 (3 — r sin 2 /3. 



4 Note that two solutions are possible for V2, namely V2 = iy-D — v\. Here we take the negative one, since, as we noted before, v% and 
cosa must have opposite signs and —it/2 < a < n/2, thus cosa is always positive. 
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A. Discussion 

First, observe that the watermarking displacement signal w, and therefore also the watermarked sequence y, lies 
in the plane spanned by the watermark u and the host signal x (a similar conclusion was reached in [15] in the 
attack-free case). This allows to express the optimum watermarking displacement signal, as well as the watermarked 
sequence, as a combination of the host signal and the watermark, leading to the following result: 

Corollary 1: The optimum watermarked signal is given by y = ax + bu, where 

cos 2 (3 

a = 1 , 

cos a 

b = \fr ■ tan a cos 2 (5 ± \J D — r cos 4 (5. 

Proof. From Theorem [2] we have: 

T/i = \frvr sin a± \/ n(D — r cos 4 (3) (14) 
2/2 = Vnr[cos a — cos2 P] ■ 

2 r> 

On the other hand, yo = aJnr cos a, and so, we can conclude that a = 1 — cos . To find b, we use = 

' v ' ' COS OL & l 

a^/nr since + by/n, which when combined with (fT~4T >. gives the value of b is asserted in Corollary 1. This completes 
the proof of Corollary 1. 

It should also be pointed out that the optimum embedding strategy depends neither on a 2 x nor on a 2 z , which is 
the desirable required universality feature. As a consequence, the embedding strategy is the same for the attack-free 
case, studied in detail in Section [VI] 

The geometrical interpretation of the embedding strategy is the following: the embedder devotes part of the allowed 
distortion budget to scale down the host signal, thus reducing its interference, and then injects the remaining energy 
in the direction of the watermark. In fact, this explains why only the component of the watermarked signal in the 
direction of the watermark (i.e., b) depends on D. For illustration, we compare the optimum embedding and the sign- 
embedder introduced in [ 15 1. For the sign embedder, the watermarked signal is given by y se = x+sign(x* • u)V"Du, 
so the watermarking displacement signal can be written as w se = sign(x*-u)v^Du. The two strategies are compared 
in Fig. [H where it is easy to see that the proposed strategy is that of minimizing the embedding distortion necessary 
for obtaining a watermarked signal. It is also interesting to observe that the optimum embedding technique given 
by Theorem 2, could not be described by [16], as in that case the watermarking displacement signal direction is 
just a function of the host signal, and it is scaled for obtaining the desired distortion. 

Another way to look at Theorem 2 is by evaluating a joint condition on the embedding distortion and the false- 
positive exponent (or equivalently on (3) that allows to obtain a false-negative error exponents: if T\ < 0, then the 
optimization in (fT3l l is performed on the region [0, oo) x [0, oo), so any pair (cr^cr^), even with ct| = 0, will 
be in the allowed region, yielding a vanishing error exponent. The condition that permits to avoid this situation is 
r < We can reach the same result by considering the case a = 0, which is the case that captures most of 
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Fig. 1 . Geometrical interpretation of the optimum embedding problem, and comparison between the sign-embedder and the optimum embedder. 
w™ t n and w™ m denote the minimum norm watermarking displacement signals that produce signals in the detection region, for both the optimal 
embedder and the sign embedder, respectively. The corresponding watermarked signals are y™ 1 ™ and y™ m . Furthermore, one can see the 
watermarked signals for the optimal embedder and the sign embedder when part of the embedding distortion can be used to gain some robustness 
to noise (denoted by y£°j? and y r s ° b ). 

probability. In this case, the two components of the watermarked signal y are given by 

yi = ±\/n(D — rcos 4 (3), 
y 2 = Vnr(l - cos 2 /3), 

or equivalently a = 1 — cos 2 (3 and b = ± \J D — r cos 4 (3. Therefore, when D = r cos 2 j3 the watermarked signal 
is the intersection of the boundary of the detection region and the perpendicular vector to that boundary that goes 
through x. On the other hand, when D < r cos 2 (3, even in the noiseless case, one cannot ensure that the embedding 
distortion constraint allows to produce a signal in the detection region, so the embedding function in that case will 
not be so important. In fact, regardless of the embedding function we choose, the false negative error exponent 
would vanish. 

B. False Negative Exponent of the Optimum Embedder 

Having solved the optimum embedding problem, we can compute the false-negative exponent achieved by the 
optimum embedder and compare it to previous results in the literature. To do so, the optimization in (fT~3T > is performed 
over points of the form (Ti(r),r) = (Dtan 2 /3 — rsin 2 /3,r), with < r < co ^ „ . The derivative of ( fl3l ) with 
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respect to r takes the value 



1/1 I 

2 



cos 2 j3 sin j3 



r a\ D — r cos 2 (3 <r| 

which is piecewise convex in (0, D/ cos 2 f3), and (D/ cos 2 /3, oo). Due to the constraints introduced previously, we 
are interested in the minimum in the interval (0, D/ cos 2 (3), which is achieved when 



Da% + 2<t 2 z <Jx cos 2 (3 - Da\ sin 2 (3 



D 2 a% + 4ct|c4 cos 4 (3 - 2D 2 a\a\ sin 2 + D 2 a 4 x sin 4 (3 ) x 



2(cr| cos 2 (3 - a 2 x cos 2 (3 sin 2 (3) 



(15) 



By replacing r with r* in the definition of Ti(r) we get the value of q*, then we insert r* and <j* in (fl3t . and 
finally obtain the optimum error exponent for the AWGN case: 



2DC4 + \l l&a%o\ cos 4 /3 + D 2 [2cr| - cr^l - cos(2/3))] z tan 2 /3 



- 2cr| sin 2 /3 (2cr| + D tan 2 /?) 



[4 (4 - u\ sin 2 , 



E fn - 2 



— - In ( — 



- 1 



~2" ~ ln ( ~T 



- 1 



(16) 
(17) 



Note that due to the choice of U and the symmetry of the Gaussian distribution followed by the host around zero, 
the false-negative error exponent does not depend on the particular choice of the watermark u. 

In Figs. [2] [3] and [4] the behavior of E% n is depicted as a function of various parameters. As expected, the false- 
negative exponent decreases when the false-positive exponent A, the attack variance a 2 z , or the host variance a\, 
increase, while it increases with D. 



C. Numerical Results 

In order to validate the theoretical results with numerical ones, we compare the false-negative exponent with the 
empirical values of — ilnP/ n , for large n. Although large values of n and — ilog(P/ ra ) can not be considered 
simultaneously, due to the resulting very small probability of false negative, in Fig. [5] we can see the similarity 
between Ej n and its empirical approximation when n increases, for different values of a z . Furthermore, in Fig. [6] 
we compare the empirical approximation of the false-negative exponent to its theoretical value for the attack-free 
case (special attention will be paid to this particular case in Section I VP . for different values of A. As expected, the 
larger is A, the smaller is the false-negative exponent. 
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VI. The Attack-Free Case 

As a special case of Theorem 1 and Theorem 2, we calculate the false-negative exponent for the noiseless case 
(<r| = 0). By computing the limit of a\ — ► in ( TT3T >, it is easy to see that in the attack-free case, we have: 



lim r* 

<rl->Q 



D 



D 



-2Do% sin 2 (3 

2a% cos 2 (3 sin 2 (3 cos 2 f3 1-e 



-2A 



To compute lim CT 2 ^ from ( TToT l we can use L'Hopital's rule. Given that 

z & z 



2L>cr| + a/ 16erf er 4 , cos 4 /3 + D 2 [2a% - a 2 x {l - cos(2^))] z tan 2 /3 



-2a\ sin 2 (3 (2a\ + £> tan 2 /?) 



= -4fT^ sin 2 /3, 



and 



(18) 



lim 



da% 



a\ [4 (a\ - a\ sin 2 /?)] = -4a| sin 2 /?, 
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we conclude that 



lira —5- = 1. 



(19) 



From ( fT8l ) and ( fT9b , it is straightforward to see that the value of the false-negative exponent for the attack-free 
case is given by 

1 if D 



lim £ 



D 



ln 



- 1 



elsewhere 



(20) 



In view of d20l i. it is interesting to note that as long as D > a 2 x , Ej n > for any A. In fact, under these conditions, 
the asymptotic value of Ef n when A — > 00 is 













- 1 









(21) 



coinciding with the result of [2. Corollary 1]. 

On the other hand, when D < <j 2 x another interesting point which reflects the goodness of the proposed strategy 
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Fig. 4. False negative error exponent as a function of ax, for several embedding distortions, a z = 1 and A = 0.1. 



is the computation of the range of values of A where Ef n > can be achieved. In this case, the condition to be 
verified is 

implying that 

\<-^ln(l- -^-J = Ai, for D<a 2 x , (23) 

whereas for the sign embedder [15], the values of A for which Ef n > are those such that 

D 1 - e~ 2A 

^ > -1=53^' (24) 

"x e 



or, equivalently, 



* < -I ln ( r^ UX 9 ) = A 2, for all D. (25) 



2 V D + a\ 



Given that Ai > A2, larger values of false positive error exponents are allowed (while still keeping Ef n > 0) by 
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Fig. 5. Theoretical false negative error exponent and — i log(P/„) as a function of the number of dimensions n. D = 2, ct^ = 1 and 
A = 0.6, and equal to 0.52, 0.53, 0.54 and 0.55, respectively (from top to bottom). 



the new embedding rule. In Figure [7] we compare the bounds on the false-negative exponent for the attack-free 
case found in |15|, with its optimal value derived here. As can be seen, the improvement owing to the optimum 
embedding strategy is significant, especially for small A. 

VII. Conclusions 

We derived a Neyman-Pearson asymptotically optimum one-bit watermarking scheme in the Gaussian setting, 
when the detector is limited to base its decisions on second order empirical statistics only. The scenario we considered 
is universal in the sense that the variance of both the host signal and the attack are not known to the embedder and 
to the detector. Our main results are simple closed-form formulas for both the optimum embedding function and 
the corresponding error exponents. The noiseless scenario can be seen as a special case, where we can compare 
the false-negative exponent achieved by the optimum scheme with the bounds derived in fl5l . Interestingly, the 
optimum embedder is very simple thus opening the door to practical implementations. 

This work can be extended in many interesting directions, including non-Gaussian settings, more complicated 
attacks, like de-synchronization attacks ll2Tl . Il22l . more detailed empirical statistics gathred by the detector, and 
the introduction of security considerations in the picture [23|. 
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